Predicting protein thermostability changes from sequence upon multiple mutations
نویسندگان
چکیده
MOTIVATION A basic question in protein science is to which extent mutations affect protein thermostability. This knowledge would be particularly relevant for engineering thermostable enzymes. In several experimental approaches, this issue has been serendipitously addressed. It would be therefore convenient providing a computational method that predicts when a given protein mutant is more thermostable than its corresponding wild-type. RESULTS We present a new method based on support vector machines that is able to predict whether a set of mutations (including insertion and deletions) can enhance the thermostability of a given protein sequence. When trained and tested on a redundancy-reduced dataset, our predictor achieves 88% accuracy and a correlation coefficient equal to 0.75. Our predictor also correctly classifies 12 out of 14 experimentally characterized protein mutants with enhanced thermostability. Finally, it correctly detects all the 11 mutated proteins whose increase in stability temperature is >10 degrees C. AVAILABILITY The dataset and the list of protein clusters adopted for the SVM cross-validation are available at the web site http://lipid.biocomp.unibo.it/~ludovica/thermo-meso-MUT.
منابع مشابه
PROTS-RF: A Robust Model for Predicting Mutation-Induced Protein Stability Changes
The ability to improve protein thermostability via protein engineering is of great scientific interest and also has significant practical value. In this report we present PROTS-RF, a robust model based on the Random Forest algorithm capable of predicting thermostability changes induced by not only single-, but also double- or multiple-point mutations. The model is built using 41 features includ...
متن کاملReliable prediction of protein thermostability change upon double mutation from amino acid sequence
SUMMARY The accurate prediction of protein stability change upon mutation is one of the important issues for protein design. In this work, we have focused on the stability change of double mutations and systematically analyzed the wild-type and mutant residues, patterns in amino acid sequence and locations of mutants. Based on the sequence information of wild-type, mutant and three neighboring ...
متن کاملMutations to alter Aspergillus awamori glucoamylase selectivity. IV. Combinations of Asn20-->Cys/Ala27-->Cys, Ser30-->Pro, Gly137-->Ala, 311-4 loop, Ser411-->Ala and Ser436-->Pro.
Six previously constructed and nine newly constructed Aspergillus awamori glucoamylases with multiple mutations made by combining existing single mutations were tested for their ability to produce glucose from maltodextrins. Multiple mutations have cumulative effects on glucose yield, specific activity and thermostability. No general correlation between glucose yield and thermostability was obs...
متن کاملPredicting Protein Thermostability Upon Mutation Using Molecular Dynamics Timeseries Data
A large number of human diseases result from disruptions to protein structure and function caused by missense mutations. Computational methods are frequently employed to assist in the prediction of protein stability upon mutation. These methods utilize a combination of protein sequence data, protein structure data, empirical energy functions, and physicochemical properties of amino acids. In th...
متن کاملPredicting protein stability changes from sequences using support vector machines
MOTIVATION The prediction of protein stability change upon mutations is key to understanding protein folding and misfolding. At present, methods are available to predict stability changes only when the atomic structure of the protein is available. Methods addressing the same task starting from the protein sequence are, however, necessary in order to complete genome annotation, especially in rel...
متن کامل